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Abstract 

This paper presents different methods for solving parallel machine schedul- 
ing problems with precedence constraints and setup times between the jobs. 
Limited discrepancy search methods mixed with local search principles, domi- 
nance conditions and specific lower bounds are proposed. The proposed meth- 
ods are evaluated on a set of randomly generated instances and compared with 
previous results from the literature and those obtained with an efficient com- 
mercial solver. We conclude that our propositions are quite competitive and 
our results even outperform other approaches in most cases. 
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1 Introduction 



This paper deals with parallel machine scheduling with precedence constraints 
and setup times between the execution of jobs. We consider the optimization of two 
different criteria: the minimization of the sum of completion times and the mini- 
mization of maximum lateness. These two criteria are of great interest in production 
scheduling. The sum of completion times is a criterion that maximizes the produc- 
tion flow and minimizes the work-in-process inventories. Due dates of jobs can be 



associated to the delivery dates of products. Therefore, the minimization of maxi- 
mum lateness is a goal of due date satisfaction in order to disturb as less as possible 
the custo mer who is delivered with the longest delay. These problems are strongly 



NP-hard flGraham et al.l . 119791 ) 



The parallel machine scheduling problem has been widely studied ( ICheng and Sinl . 



1990), specially because it appears as a relaxation of more complex problems like the 



hybrid flow shop scheduling problem or the RCPSP (Resource-Constrained Project 
Sc heduling Problern ) . Sev eral methods have been proposed to solve t his problem. 



In IChen and Powell! ( 1l999l ). a column generation strategy is proposed. iPearn et al. 



(120071 ) propose a linear program and an efficient heuristic for large-size instances for 



Salem et al 



Neronetal 



the re solution of priority constraints and family setup times problem. 
( 2000 ) solve the problem with a tree search method. More recently, 
(120081 ) compare two different branching schemes and several tree search strategies 
for the problem with release dates and tails for the makespan minimization case. 

However, the literature on parallel machine schedul ing with precedence con- 
strain ts and setup times is quite limited. iBaev et al.l (12002! ) and Ivan den Akker et al. 
(120051 ) deal with the problem with precedence constraints for the minimization of 
the sum of completi on times and maximum laten ess resp ectively. The setup times 



up 

case is considered in ISchutten and LeussinkI ( 19961) and in lOvacik and Uzsovl (11995! ) 
for the minimization of maximum lateness. lUzsoy and Velasquez! (120081 ) deal with 
the s ame criterion on a single machine with family-dependent setup times. Fi- 
nally, iNessah et al.l (12005! ) propose a lower bound and a branch-and-bound method 
for the minimization of the sum of completion times. 

Problems that have either precedence constraints or setup times, but not both, 
can be solved by list scheduling algorithms. It means there exists a total ordering of 
the jobs (i.e., a list) t hat, wher i a giv en machine assigment rule is applied, reaches 
the optimal solution (!Schuttenl . !l994 ). For a regular criterion, this rule is callec 
Earliest Completion Time (ECT). It consists in allocating every job to the machine 
that allows it to be completed at the earliest. This reasoning unfortunately does not 
w ork when precedenc e const raints and setup times are considered together, as shown 
in iHurink and Knust! (12001! ). We have then to modify the way to solve the problem 



and consider both scheduling and resource allocation decisions. 

In Section 2, we define formally, the parallel machine scheduling problem with 
setup times and precedence constraints between jobs. In Section 3 we present a 
branch-and-bound method and its components: tree structure, lower bounds, and 
dominance rules. Discrepancy-based tree search methods are described in Section 4. 
In Section 5 we present the hybrid tree-local search methods used to solve large-size 
instances. Section 6 is dedicated to computational experiments. 
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2 Problem definition 



We consider a set J of n jobs to be processed on m parallel machines. The 
precedence relations between the jobs and the setup times, considered when different 
jobs are sequenced on the same machine, must be satisfied. The preemption is 
not allowed, so each job is continually processed during pi time units on the same 
machine. The machine can process no more than one job at a time. The decision 
variables of the problem are the start times of every job i = l..n, Si, and let us define 
Cj as the completion time of job i, where Ci = Si + Pi- Let and di be the release 
date and the due date of job i, respectively. Due dates are only considered for job 
lateness computation. We denote by E the set of precedence constraints between 
jobs. The relation G E, with i,j G J, means that job i is performed before job 
j {i -< i) such that job j can start only after the end of job i {Sj > Ci). Finally, we 
define Sij as the setup time needed when job j is processed immediately after job i 
on the same machine. Thus, for two jobs i and j processed successively on the same 
machine, we have eithe r Sj > d + Sjj i f i pr ecedes j, or Si > Cj + Sji if j precedes 
i. Using the notation of iGraham et al.l (119791 ). the problems under consideration are 
denoted: Pm\prec, Sij,ri\ J2Ci for the minimization of the sum of completion times 
and Pm\prec, Sjj,rj|Lniax for the minimization of the maximum lateness. 



Example 

A set of 5 jobs (n = 5) must be executed on 2 parallel machines (m = 2). For 
every job i, we give Pi, r,, di, and Sij (see Tabled]). Besides, for that example we 
have the precedence constraints: 1 -< 4 and 2 -< 5. 
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Table 1: Example 1 data 



Figure [T] displays a feasible solution for this problem. The set of precedence 
constraints is satisfied: 5*5 = 13 > 3 = C2 and 5*4 = 5 > 5 = Ci. We stress that job 
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4 must postpone its start time on M2 by one time unit because of the precedence 
constraint. On the other hand, we have to check that, for every job z, < 5*^ and 
that setup times between two sequenced jobs on the same machine are also respected. 
For the evaluation of the solution, we observe that for the minimization of the sum of 
completion times the value of the function is z = Y^Ci = 43) and for the minimization 
of maximum lateness z = Lmax = -^^5 = 10. 



n Sl3 S35 





1 




3 




5 














2 




4 






S24 





1 3 5 8 12 13 15 



Figure 1: Feasible schedule 



3 Branch-and-Bound components for Pm\prec^ Sij^ Vi 
and Pm\prec, s^j, r^\L^^^ 

A tree structure with two levels of decisions (scheduling and resource allocation) 
is defined in Section 13. 1[ Lower bounds, constraint propagation mechanisms and 
dominance rules are introduced in Sections 13.21 and 13. 3[ 

3.1 Tree structure 

Precedence constraints and setup times s cheduling problems may n ot be efficiently 



solved by a list algorithm as conjectured by lHurink and KnustI (120011 ). It means that 
there possibly does not exist a job allocation rule that reaches an optimal solution 
where all the possible lists of jobs are enumerated. Let us consider the minimization 
of the sum of completion times for 4 jobs scheduled on 2 parallel machines. The data 
of the problem are displayed in Table 2. 



4 



(a) 

n Pi Tj 
110 

2 1 

3 1 2 

4 1 2 



(b) 

s,j 12 3 4 

1 To 2 To 

2 10 1 1 

3 10 10 10 

4 10 10 10 



Table 2: Example 2 data 



If we consider the problem without precedence constraints, we find two optimal 
solutions {Y^Ci = 9) when we allocate the jobs following the Earliest Completion 
Time rule for the lists {1, 2, 4, 3} and {2, 1, 4, 3} (see FigureEti). Now, let us consider 
the same problem with the precedence constraint 3 -< 4. In that case, there does not 
exist any allocation rule that reaches an optimal solution for any list of jobs that 
respects the precedence constraint. The optimal solution (J^Ci = 11) is reached 
when we consider the list {1, 2, 3, 4} and job 3 is not allocated on the machine that 
allows it to finish first (see Figure [2b). Thus, in our problems we have not only to 
find the best list of jobs but also to specify the best resource allocation. 
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(a) Optimal schedule without the prece- (b) Optimal schedule with the precedence 
dence constraint constraint 

Figure 2: Example of job allocation 
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The optimal solution can be reached by a two decision-level tree search. We define 
a node as a partial schedule a{p) of p jobs. Every node entails at most m x [n — p) 
child nodes. The term n — p corresponds to the choice of the next job to be scheduled 
(job scheduling problem). Only the jobs with all the previous jobs already executed 
are candidates to be scheduled. Once the next job to be scheduled is selected we have 
to consider the m possible machine allocations (machine allocation problem). For 
practical purposes, we have mixed both levels of decision: one branch is associated 
with the choice of the next job to schedule and also with the choice of the machine. A 
solution is reached when the node represents a complete schedule, that means when 
p = n. 



3.2 Node evaluation 

Node evaluation differs depending on the studied criterion. First, we propose to 
compute a simple lower bound. For every node (partial schedule), we update the 
earliest start times of the unscheduled jobs taking account of the branching decisions 
through precedence constraints and we calculate the minimum completion time (for 
min J2 Ci criterion) and the minimum lateness (for min Lmax criterion) for every not 
yet-scheduled job. Then we update the criterion and we compare the lower bound 
with the best current solution. 

We propose to compute an upper bound. The upper bound is computed by a 
simple list scheduling heuristic selecting the combination of job, between the not 
yet-scheduled jobs, and machine with the shortest start time. 

For criterion min Cj, we also propose to compute the lower bound presented 



m 



Nessah et al.l (120051 ) for the parallel machine scheduling problem, with sequence- 
dependent setup times and release dates (Pm|sjj,rj| J^Ci). This problem is a relax- 
ation of the problem with precedence constraints, so the lower bound is still valid for 
our problem. In this paper, we just present the lower bound for the proble m, that 



is based on job preemption relaxation, and we refer to iNessah et al.l (120051 ) for the 
proof. 

Let S^: be the schedule obtained with the SRPT (Shortest Remaining Process- 
ing Time) rule for the relaxed problem l|rj, + s*),pmtn| J2^^^{C* — s*,ri + pi), 
where Sj = min^yj and s* = ^. Let C[*](S'*) be the modified completion time 
of job i with the processing time Pi + s* for each job i. Let = Pi + ri + s* 
and let (a[i], ap], . . . , a[„]) be the series obtained by sorting (ai, 02, ... , a„) in non- 
decreasing order. Then LB = X] niax[C[*](5'*), a[j]] — is a lower bound for 
Pm\prec, Sij,ri\J2Ci. The complexity of the lower bound is 0(?7,logn), the same 
complexity as SRPT. 
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For minLmax, the evaluation consists in triggering a satisfiability test base d 



on constraint propagation involving energetic reasoning (iLopez and Esquiroll . Il996l ). 
The energy is produced by the resources and it is consumed by the jobs. We ap- 
ply this feasibility test to verify whether the best solution reached from the current 
node will be at least as good as the best current solution. We determine the mini- 
mum energy consumed by the jobs {Econsumed) over a time interval A = [^1,^2] and 
we compare it with the available energy {Eproduced = m x (t2 — ti)) . In our prob- 
lem we also have to consider the energy consumed by the setup times {Esetup)- If 
Econsumed + Egetup > Eproduced we cau pruuc the uodc. 

For an interval A where there is a set F oik jobs that may consume energy, we can 
easily show that the minimum quantity of setups which occurs is a = max(0, k — m). 
So, we have to take the a shortest setup times of the set {sij}, i,j G F, into account. 

The energy consumed in an interval A is Econsumed = J2i niax(0, min(pj, ^2—^15''^^ + 
Pi — ti, t2 — d[ + S[z] where s\[\ are the setup times of the set {sj-,}, i, j e F, 

sorted in non-decreasing order, and a time window [r^', c?-] for every not yet-scheduled 
job i is issued from precedence constraint propagation: 



• = maxjrj, Vj + pj; V j G } and d[ = min{Zbest + di, d'j - pj] V j G F+}, 

where F~ and F^ are respectively the set of previous and successor jobs for job i and 
Zfjcst is the minimum current value for Lmax- 

In Figure [3] we illustrate how to compute the energy consumed by the not yet- 
scheduled jobs (1 to 5 in the example) for a 3-machine problem. For every job, 
we determine a time window and the minimum energy consumed (in grey) over the 
selected interval A = [^1,^2]- For E setup we have to take the a shortest setup times, 
in the example /c = 4 (there is no consumption for job 1) and m = 3, so we have to 
sum only the shortest setup time between the consuming jobs, in our case we add 2 
energy units (value of S35). 
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Figure 3: Minimum energy consumed in a partial schedule 



The time interval A = [ti, considered to compute the energy consumed is 
tl = minr-,Vz G F and t2 = d'j, where j is the job with the shortest time window 



min {d'j — r'j),\/j G F. The complexity of the energetic test is 0(n^ 



3.3 Dominance rules 

We also propose dominance rules to restrict the search space. They consist in trying 
to find whether there exists a dominant node allowing us to prune the evaluated 
node. All proposed rules are based on the dominance properties of the set of active 
schedules. A schedule S is active if no feasible schedule can be obtained from S by 
left-shifting a single activity. Let us define the front of a partial schedule as the set 
of the last jobs executed on the machines (the ones with the largest start times). 

We first present a global dominance rule based on max fiow computation based on 
a resource-fiow model pre viously used for the reso urce-constrained project scheduling 



problem with setup times (INeumann et al.l (120021 ). Section 2.13). The idea is to verify 



that there exists a partial schedule o"'(p) with the start times 
S' = {S[, S2, ■ ■ ■ , S^, . . . , S'p} different from cr(p) with start times 
S = {Si, S2, . . . , Si, . . . , Sp} that allows us to move forward the start time of job 
k without modifying other start times {S'^ = Si,Wi 7^ k and S'f. < Sk — I). This is 
a necessary but not a sufficient condition for the dominance. Besides, the schedule 
cr'(p) has to keep the same front as a{p) except for the case where job k does not 
belong to the front of cr'(p) (the dominant partial schedule). For example in Fig- 
ure m job 5 (5*5 = 18) may be scheduled after job 4 or between job 2 and job 4 
with a shortest start time {S'^ = 17). In the first case the new schedule cr'(p) is not 
dominant because of setup times but in the second case it is, so the front can be 
modified only if job k is not part of it in a'{p). 
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We represent cr'(p) by a graph and we turn the dominance rule in a max flow 
computation. Two vertices are considered for every job, the first one represents 
the start time it and the second one the completion time ig of the job. One unit 
capacity arcs are defined between the vertices ig-jt by the partial schedule cr'(p) and 
they represent the transfer of resource units between the jobs. Finally, we need four 
dummy vertices. Two vertices (0^, Ot), the source node S and the sink node T, flow 
origin and flow destination, respectively. Arcs S-Og and Ot-T have m-unit capacity 
and represent the resource constraint. 1-unit capacity arcs between S-ig and it-T 
ensure the job execution. 
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Figure 4: Partial schedule of the evaluated node 



Figure [5] shows the flow network for the schedule depicted in Figure H] (data of 
Table [1]). For each node we try to find a schedule that allows us to move forward 
the start time of the last scheduled job by one unit (job 5 in the example, S'^ = 17) 
and to keep the same start times for the other jobs. We create a direct arc ig-jt if 
Sj > S'^+Pi + Sij, that means if job j can be executed on the same machine than job 
i. In order to respect the second condition for the dominance, we do not create the 
arcs between the jobs belonging to the front in the evaluated node (job 4 and job 5). 
We observe that a max flow of m + p units is necessary to ensure all job executions 
and to satisfy the resource constraints. In that case, cr'{p) is a feasible schedule and 
we can prune the node. 
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Figure 5: Network to compute the max flow dominance rule 



We propose a second dominance rule based on the position of the front jobs in the 
priority list. For a given schedule, the dominance rule searches for a new list of jobs 
in order to obtain the dominant partial schedule. We modify the list of scheduled 
jobs taking into account the precedence constraints. We can prune the evaluated 
node when the dominant partial schedule keeps the same front than the evaluated 
node (jobs 1, 2, and 3), one of the jobs starts earlier {S[ < Si) and for the rest of 
jobs belonging to the front the start times are not delayed (5*2 = S'2 and 5*3 = S'3), 
as we see in Figure O 
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Figure 6: Example of dominant partial schedule 



We propose to permute the order of the m front jobs in order to find the dominant 
schedule. For example, in Figure [6] if the order of scheduled front jobs is 1 — 2 — 3 we 
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test all the possible permutations satisfying precedence constraints. If one of such 
permutations yields a dominant partial schedule, we can prune the evaluated node. 
This rule can be computed with time complexity 0{m\). As shown in Section [5], 
despite its exponential worst-case complexity, this dominance rule has interesting 
properties when used in conjunction with discrepancy-based tree search and remains 
efficient for a small number of machines. A partial enumeration remains valid if m 
becomes very large. 

Note similar dominance rules have already been used for the RCPSP (which 
can be defined as an extension of the parallel machine scheduling problem with 

preced ence constraints, but without setup tim es) under th e name "cutset dominance 

rules" (jPemeulemeester and Herroelenl . 119971 ). However, in lDemeulemeester and Herroelen 
(119971 ). all the cutsets are kept in memory yielding important memory requirements. 



4 Discrepancy-based tree search methods 

4.1 Limited discrepancy search 

To tackle the combinatorial explosion of the standard branch-and-bound methods 
for large problem instances, we use a method based on the discrepancies regarding 
a reference branching heuristic. Such a method is based on the assumed good per- 
formance of this reference heuristic, thus making an ordered local search around the 
solution given by the heuristic. First, it explores the solutions with few discrepancies 
from the heuristic solution and then it moves away from this solution until it has 
covered the whole search space. In this c ontex t, the principle of LDS (Limited Dis- 



crepancy Search) (IHarvey and Ginsberg) . Il995l ) is to explore first the solutions with 



discrepancies on top of the tree, since it assumes that the early mistakes, where very 
few decisions have been taken, are the most important. 

Figure [7] shows LDS behavior for a binary tree search with the number of dis- 
crepancies for every node. Let us consider the left branch as the reference heuristic 
decision. At iteration we explore the heuristic solution, then at iteration 1 we 
explore all the solutions that differ at most once from the heuristic solution, and we 
continue until all the leaves have been explored. 

LDS can be used as an exact method, for small-size instances, when the maximum 
number of discrepancies is authorized. We can also use it as an approximate method 
if we limit the number of authorized discrepancies. 

Several metho ds based on LPS have been proposed to improve its efficiency. ILDS 



(Improved LDS) (IKorfl . Il996l ) has been devised to avoid the redundancy (observed 



in Figure [7j) where the solutions with no discrepancies are also visited at iteration 1. 
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Figure 7: Limited Discrepancy Search for a binary tree 



DDS (Depth- bounded Discrep ancy Search) (IWalshl. ll997D or DBDFS (Discrepancy- 
Bounded Depth First Search) ( Beck and Perron . 2000l ) propose to change the order of 
the search. DDS hmits the depth where the discrepancies are considered, in the sense 
that at the k^^ iteration we only authorize the discrepancies at the first k levels of the 
tree. It stresses the principle that the early mistakes are the most important. DBDFS 
consists in a classical DFS where t he nodes explored ar e limited by the discrepancies. 



Recently, in the YIELDS method (IKaroui et al.l . 120071 ). learning process notions are 
integrated. In what follows, we propose several versions of LDS adapted to the 
considered parallel machine scheduling context. 



4.2 Exploration strategy 

As a branching heuristic, we use the same heuristic to compute the lower bound 
presented in Section [3l2} EST (Earliest Start Time) rule for the selection of the next 
job to schedule and the resource to execute it. We take criterion EST because it 
is intuitively compatible with the minimization of setup ti mes which has globall y 



a positive impact for minimization of other regular criteria (lArtigues et al.l . l2005l ). 

In case of tie between two jobs, we apply SPT (Smallest Processing Time) rule for 
minX]C*i and FDD (Earliest Due Date) for minLmax- 

Because of the existence of two types of decisions, we consider here two types of 
discrepancies: discrepancy on job selection and discrepancy on resource allocation. 
In the case of non-binary search trees, we have two different ways to count the 
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discrepancies (see Figure [H]). In the first mode (binary), we consider that choosing 
the heuristic decision corresponds to discrepancy, while any other value corresponds 
to 1 discrepancy. The other mode (non-binary) consists in considering that the 
further we are from the heuristic choice the more discrepancies we have to count. 
We suggest to evaluate experimentally both modes for the heuristic for job selection. 
On the other hand, for the choice of the machine, we use the non-binary mode since 
we assume that the allocation heuristic only makes a few errors. As we will see in 
Section [5], selecting the machine which allows the earliest completion of the job is a 
high performance heuristic. 



Heuiisb'clistof job; 



Heuristic list of job; 
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(a) binary (b) non-binary 

Figure 8: Example of discrepancies counting modes on job selection 



We p ropose to test three d ifferent branching schemes. The first one, called 
DBDFS fiBeck and Perronl . l2000f ). is a classical depth-first search where the solutions 
obtained are limited by the allowed discrepancies (see Section HT]) . We propose two 
other strategies, LDS-top and LDS-low, which consider the number of discrepancies 
for the order in which the solutions are reached. The node to explore is the node 
with the smallest number of discrepancies, and with the smallest depth for the strat- 
egy called LDS-top, and with the largest depth for the strategy called LDS-low. As 
Figure [9] shows (case of 2 authorized discrepancies) all three methods explore the 
same solutions but in different orders. 
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LDS-top 1 5 4 11 3 
LDS-low 1 2 3 6 4 
DBDFS 12 3 4 5 

Figure 9: Order of explored 
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4.3 Large neighborhood search based on LDS 



We have presented LDS as an exact or a truncated tree search method. In this 
section, we propose to use it as part of local search. In a local search method, we 
define a solution neighborhood Nk{x) {k defines the acceptable variations of solution 
x). If we find a solution x' better than x in Nk{x) then we explore the neighborhood 
Nk{x') of this new best solution. In the case of large-scale neighborhoods problems, 
the neighborhood becomes so huge that we can consider the search for the best 
solution in Nk{x) as an optimization sub-problem (jShawl . Il998l ). In that context, we 
consider a neighborhood defined by an L PS search tree. 

CDS (Climbing Discrepancy Search) f Milano and Roli 20021 ) is the first large 
neighborhood search method based on LDS (see Algorithm [1]) . At each iteration it 
carries out a ^-discrepancy search around the best current solution. If a better solu- 
tion is found, then CDS explores its neighborhood. In the case of no better solution is 
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found, then k is increased by one. 



Algorithm 1: Climbing Discrepancy Search 
begin 

Solref Initial HeuristicQ; 
while k < kmax do 

/* Generate the set of solutions N of k discrepancies fr 

= LD S [Solref, k)- 
s' ^ BestOf{N)- 
if z{s') < z{Solref) then 

Solref ^ — S ] 

k^l; 
else 

\_ k^k + 1] 
end 



The d rawback of CDS is that for large-size instances the neighborhood quickly 
explodes. iHmida et all (120071 ) propose CDDS (Climbing Depth-bounded Discrepancy 
Search) that mixes principles of CDS and of DDS. The neighborhood of the best 
solution is limited not only by the number of discrepancies but also by the depth in 
the tree. In that case, the neighborhood explosion is avoided and the idea that the 
most important heuristic mistakes are early ones is stressed. 

In this work, we propose two variants of CDS and CDDS for the problems at hand. 

They are closely related with VNS (Variable Neighborhood Search) flHansen and Mladenovid . 
200 ll ) concept, since we modify the size and the structure of the neighborhood ex- 
plored. HD-CDDS (Hybrid Discrepancy CDDS) (see Algorithm [2]) consists in a mix 
of CDS and CDDS. We start with a CDS search, but if for a defined number of dis- 
crepancies kiimit we cannot find a better solution, then we authorize a bigger number 
of discrepancies only between some levels {[dmin,dmax]) ■ Once we have finished the 
search for kumit + 1, we propose either to increase the number of authorized discrep- 
ancies and to keep the same number of levels where the discrepancies are authorized 
(x = dmax—dmin), which is the case in Algorithm[2l or to increase the number of levels 
and to keep the number of discrepancies. This method solves the problem of neigh- 
borhood explosion and offers more jobs mobility than CDDS (which is particularly 
interesting for setup times problems) but we need to parametrize the values of the 
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Algorithm 2: Algorithm HD-CDDS 



search {kumit, x). 



begin 

k i- 

dm.i 



0; 



Solref ^ InitialHeuristici)] 

while termination conditions not met do 

/* Generate the set of solutions N of k discrepancies from Solref */ 

N = GenSol{Solref, k, dmin, dmax); 

s' ^ BestOf{N)- 
if z{s') < z{Solref) then 

Solref * 

k^l: 



s': 



dmin 0, 
dmax ^ ^1 

else 

if A; < kiimit then 
I k ^k + 1; 
else 



if d 



max 
dmin 
dmax 

else 

d, 

d; 

if dmin 
k^ 



"mm 
j 

''max 



dmin = n then 
-0; 

- X] 

~ dmax ) 
~ dmin ~\~ X, 

> n then 

k + l- 
0; 



mm 
max ^ X 



end 



The second proposed variant, MC-CDS (Mix Counting CDS), is an application 
of CDS but with a modification in the way to count the discrepancies for the job 
selection rule only. We consider a binary counting for the discrepancies at the top 
level of the tree and a non-binary counting way for the rest of levels. This variant 
accepts discrepancies for all depth levels because the non-binary counting restricts 
the explored neighborhood. 
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4.4 Discrepancy-adapted dominance rules 

In this section we propose to adapt the second dominance rule presented in Sec- 
tion 13.31 to the principle of local search. We argue that it can be very inefficient to 
use the dominance rule as presented in Section 13.31 with the proposed local search 
methods. Indeed, the best solutions of the neighborhood could not be explored be- 
cause we have found a dominant partial schedule that allows us to prune them. Even 
if it is true that there exists a solution better than the evaluated node, it may not 
belong to the explored neighborhood. 

For that reason, we propose discrepancy-adapted dominance rules. Once we know 
the criterion that defines the neighborhood (for example, k authorized discrepancies 
from the job list L) , we only have to verify that the new list of jobs L' that reaches the 
dominant partial schedule is part of the explored nodes in the local search [L' e G, 
where G is the set of /^-discrepancies lists from L). 

We can see that the max flow computation rule presented in Section 13.31 is not 
discrepancy adaptable. It is not possible to verify that the dominant partial schedule 
(t'{p) is part of the explored space because the rule indicates the existence of cr'{p) 
but not the corresponding schedule. On the other hand, the second dominance rule 
introduced in Section 13.31 consists in a local modification of the evaluated schedule 
in order to explicitly obtain the dominant schedule. That way, we have the list of 
jobs, L', available to compare with the best current solution list of jobs, L, and to 
verify that the dominant schedule is part of the explored nodes. Hence, when the 
encountered dominant schedule is not part of the explored neighborhood the current 
node is not pruned. 

5 Computational experiments 

In this section we present the main results obtained from the implementation of our 
work. In the literature we have not found instances for parallel machines including 
both setup times and precedence constraints. Therefore, we propose to test the 
methods on a set of randomly generated instances. The algorithms are implemented 
in C-|--|- and were run on a 2 GHz personal computer with 2 Go of RAM under the 
Linux Fedora 8 operating system. 

We generate a set of 120 (60 for each criterion) small-size instances {n = 10, 
m = 3, and n = 15, m = 2) for the evaluation of the dominance rules and for the ECT 
rule efficiency. Then, we test on a set of 120 middle-size instances (n = 40, m G [2, 4]) 
the different branching rules [LDS-top, LDS-low, and DBDFS), the different ways to 
count the discrepancies {binary and non-binary) to determine the best methods for 
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being included inside the LDS structure of the local search methods. The efficiency 
of the lower bounds, the dominance rules and the energetic reasoning proposed in 
Section [3] are tested on middle and large-size instances {n = 100, m G [2,4]). We 
also compar e the CDS and the HD-CDDS methods with the results obtained in 



Neron et al.l (120081 ) for the hard instances of the -P?7i|rj, gj|Cmax problem (without 



precedence constraints and setup times). And finally, we evaluate and compare the 
proposed methods on a set of 120 large-size instances with the results obtained with 

ILOG OPL 6.0. 

We use the RanGen software (jPemeulemeester et al.l . |2003| ) in order to generate 
the precedence graph between the jobs. Setup times and time windows [rj, di] cannot 
be generated by RanGen. Setup times are generated from the uniform distributions 
f/[l, 10] and f/[20,40] . Moreover they must respect the weak triangle inequality: 
Sij < Sik + Pk + Skj,'^i,j, k. The values of pi are generated from the uniform dis- 
tribution U\l, 5] . Tira e windows are generated in a classical way we found in the 
literature f Sourdl . 2005 ). The values of di are generated from the uniform distribu- 
tion f/[max(0, P X (1 - r - p/2)), P x (1 - r + p/2)], where P = J2{Pi + minj(sij)), 
r G [0, 1], p G [0, 1]. The are generated from di, Vi = di — {pi x (2 + a)) where 
a G [-0.5, +1.5]. 

We solve to optimality the small-size instances and we compare the results ( Opti- 
mal) with the results obtained when we apply the ECT rule (ECT) for each possible 
list of jobs (jobs are only allocated to the machine which allows to finish it ffist), with 
the results using the dominance rule based on the permutation of front jobs [Front 
Rule), and with the results using the dominance rule based on max flow computation 
(Max Flow). 
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60 Instances 








n = 10, m = 3 


NbBest 


AvgNodes 


AvgTCPU 


Optimal 


60 (100.0 '/.) 


484925 


10.6 


Front Rule 


60 (100.0 Z) 


480444 


12.3 


Max Flow 


60 (100.0 Z) 


339541 


27.7 


EOT 


53 (88.3 Z) 


61684 


0.07 


60 Instances 








n = 15, m = 2 


NbBest 


AvgNodes 


AvgTCPU 


Optimal 


60 (100.0 Z) 


10126793 


641.9 


Front Rule 


60 (100.0 Z) 


9480313 


626.4 


Max Flow 


60 (100.0 Z) 


7530154 


454.6 


EOT 


54 (90.0 Z) 


1747416 


2.5 



Table 3: Results of ECT and dominance rules efficiency for minX^C'i problem 



60 Instances 








n = 10, m = 3 


NbBest 


AvgNodes 


AvgTCPU 


Optimal 


60 (100.0 Z) 


281896 


5.6 


Front Rule 


60 (100.0 Z) 


263474 


7.9 


Max Flow 


60 (100.0 Z) 


219557 


19.7 


ECT 


52 (86.7 Z) 


69141 


0.07 


60 Instances 








n = 15, m = 2 


NbBest 


AvgNodes 


AvgTCPU 


Optimal 


60 (100.0 Z) 


11936385 


884.8 


Front Rule 


60 (100.0 Z) 


10503767 


778.7 


Max Flow 


60 (100.0 Z) 


8945948 


628.4 


ECT 


54 (90.0 Z) 


4681104 


7.27 



Table 4: Results of ECT and dominance rules efficiency for min Lmax problem 
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First, note that we found some hard instances that we could not to solve to 
optimality before 15000 seconds. We observe in Tables [3] and H] that ECT rule is 
very efficient for both problems. The optimal solution is reached over almost 90 '/. of 
the instances and the average CPU time (AvgTCPU) is clearly reduced when we 
use the ECT rule. These results let us consider, for local search methods, only 
the job permutation allocating the jobs on the machines following the ECT rule. 
The dominance front rule is also effective, the average number of explored nodes 
{AvgNodes) and the average CPU time usually decrease when we use it. We observe 
that the Max Flow rule largely reduces the number of explored nodes and the CPU 
time, except for the very small-size instances. We deduce that it is a very efficient 
rule to solve to optimality instances with a larger number of jobs. 

In the comparison between the two different ways to count the discrepancies, 
binary and non-binary (only for job selection rules), we have evaluated on the middle- 
size instances the number of times each mode has found the best solution (NbBest). 
The CPU time is limited to 100 seconds. 

Table [5] shows that the binary mode has a higher performance than the non-binary 
one. Out of a set of 120 instances, the binary mode has found the best solution over 
75 '/. of the instances, independently of the branching rule. We find very similar 
results for both criteria. In the following, the binary counting is kept for the LDS 
structure of the local search. 

120 Instances NbBest 

n = 40,m g [2,4] binary mode non-binary mode 

DBDFS 90 (75.0 7.) 48 (40.0 7.) 

LDS-top 93 (77.5 7.) 49 (40.8 7.) 

LDS-low 98 (81.7 7.) 31 (25.8 7.) 

Table 5: Results of the comparison between discrepancies counting modes 

In Table |6l we can see the results for the comparison between the exploration 
strategies. In addition to previous notations, we introduce the average mean devi- 
ation from the best solution (AvgDev). The CPU time is limited to 100 seconds. 
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Binary mode minX^Cj (60 instances) minLmax (60 instances) 
n = 40, m e [2, 4] NhBest AvgDev NbBest AvgDev 

DBDFS 43 (71.7 7.) (mT 47 (78.3 Z) L86Z 

LDS-top 29 (48.3 Z) 0.43 Z 17 (28.3 Z) 2.33 Z 

LDS-low 50 (83.3 Z) 0.71 Z 59 (98.3 Z) 0.75 Z 

Table 6: Results for the comparison of different branching strategies 



We find that LDS-low is the most efficient strategy, since it reaches the best 
solution for a larger number of instances and it presents the less important average 
mean deviation when the best solution is found by another strategy. LDS-low finds 
the best solution for all instances except for one corresponding to the maximum 
lateness minimization and for 50 over a set of 60 instances for completion times sum 
minimization. We use this strategy for the remaining computational experiments. 

The lower bounds, the energetic reasoning, and the discrepancy-adapted domi- 
nance rule are compared in Tables [7] and [H We run a 30 seconds LDS search for the 
middle and large-size instances for different versions of the node evaluation. First, 
we only consider the lower bound computed using precedence co nstraint propagation 
(LBCP), then we add the lower bound {LBj^cy) proposed in iNessah et al.1 (120051 ) 
for minX)C*i problem and the energetic reasoning (ENERGY) for minLmax prob- 
lem; finally we add the discrepancy-adapted dominance rule (DaDR). We compare 
the number of times each version finds the best solution (NbBest) , the explored nodes 
average (AvgNodes), and the average CPU time needed to reach the best solution 
(TBest), only for the cases that all versions have found it. 
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60 Instances 



n = 40,m G [2,4] 


NbBest 


AvgNodes 


TBest 


LBCP 


36 (60.0 Z) 


62007 


4.52 


LBncy 


38 (63.3 Z) 


61742 


4.47 


DaDR 


35 (58.3 Z) 


53373 


1.69 


60 Instances 








n = 100, m e [2, 4] 


NbBest 


AvgNodes 


TBest 


LBCP 


26 (43.3 Z) 


9259 


17.55 


LBncy 


34 (56.7 Z) 


7813 


15.63 


DaDR 


38 (63.3 Z) 


7606 


8.71 



Table 7: Results of lower bounds and dominance rule efficiency for min Ci problem 



Tables [7] and [S] show the efficiency of the specific lower bound LBlqy and energetic 
reasoning with the computation of setup times consumption. Moreover, we find that 
the discrepancy-adapted dominance rule is very efficient for large-size instances but 
not especially interesting for the middle-size instances. However the time consumed 
to reach the best solution is reduced when we use the dominance rule for most of 
cases. 

60 Instances 



n = 40,m G [2,4] 


NbBest 


AvgNodes 


TBest 


LBCP 


47 (78.3 Z) 


93737 


4.81 


ENERCY 


48 (80.0 Z) 


99856 


4.24 


DaDR 


44 (73.3 Z) 


71737 


4.59 


60 Instances 








n = 100, m G [2,4] 


NbBest 


AvgNodes 


TBest 


LBCP 


44 (73.3 Z) 


11474 


4.29 


ENERGY 


48 (80.0 Z) 


12961 


3.58 


DaDR 


55 (91.7 Z) 


9462 


3.17 



Table 8: Results of lower bound, energetic reasoning and dominance rule efficiency 
for min Lmax problem 



We comp are CDS an d HD- CDD S methods a g ainst other tree search methods 



presented in iNeron et al.l (120081 ) . In iNeron et al.l (l2008l ) , the authors test two dif- 
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ferent branching schemes, time windows (tw) and chronological (chr), and several 
incomplete tree search techniques (truncated branch-and-bound, LDS, Beam Search 
and Branch-and-Greed) for the -Pm|rj, gj|Cmax problem. We adapt the proposed 
methods for this problem and we use the heuristic for the initial solution and the 
upper bounds proposed in their paper. In Tabled we compare LDS {z is the number 
of authorized discrepancies) and Beam Search {BS, u is the number of explored child 
nodes) results, the method with the best results in their work, against the proposed 
methods CDS and HD-CDDS. We have evaluated the number of times the method 
has found the best solution (NbBest) and for how many of them the method is the 
only one to reach the best solution (NbBestStrict) for a set of 50 ha rd instances 



(n = 100 and m = 10). The CPU time is limited to 30 seconds as in iNeron et al. 



mm . 



50 instances NbBest NbBestStrict 

LDSlZi 1 (2.0 '/.) 

7 (14.0 '/.) 

55^13 25 (50.0 7.) 3 

^S^i^4 22 (44.0 Z) 

~CDS 35 (70.0 Z) 6 

HD-CDDS 38 (76.0 Z) 9 

Table 9: Results for the comparison with other truncated tree search techniques 



Although precedence constraints and setup times are not considered in the prob- 
lem, we can observe that our propositions are strictly better. Out of a set of 50 
instances, CDS and HD-CDDS find the best solution for most of the cases and they 
find a new best solution for 6 and 9 instances respectively. Rather than contradicting 
th e statement of relat ive LDS inefficiency for parallel machine problem experienced 
by lNeron et al.l (120081 ). this demonstrates, at least for this problem, the efficiency of 
large neighborhood search based on LDS. 

Finally, we compare the local search methods with the results obtained by ILOG 
OPL 6.0. The four variants of the hybrid tree local search methods {CDS, CDDS, HD- 
CDDS, MC-CDS) are implemented with LDS-low, discrepancy-adapted dominance 
rule and binary counting (except for MC-CDS which supposes a mix counting). We 
solve the large-size instances [n = 100, m G [2,4]) for two different CPU time limits, 
30 and 300 seconds, then we compare the number of times when the best solution 
has been found by the method and the average deviation from the best solution. 
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J- \^ 1 \j — 




p~U[l,5],Sij ~ C/[l, 10] 


NbBest 


AvgDev 


NbBest 


AvgDev 




17 (56.6 7.) 


0.64 Z 


7 (23.3 Z) 


0.51 Z 


CDDS 


7 (23.3 7.) 


0.75 Z 


7 (23.3 Z) 


0.82 Z 


HD-CDDS 


16 (53.3 7.) 


0.60 Z 


14 (46.7 Z) 


0.43 Z 


MC-CDS 


17 (56.6 Z) 


0.64 Z 


10 (33.3 Z) 


0.45 Z 


ILOG OPL 


4 (13.3 7.) 


1.51 Z 


2 (6.7 Z) 


1.47 Z 


OM • J- 

c5U instances 


TCPU 


= 30s 


TCPU = 


= 300s 


p ~ {/[l,5],Sij ~ f/[20,40] 


NbBest 


AvgDev 


NbBest 


AvgDev 




9 (30.0 7) 


0.23 Z 


6 (20.0 Z) 


0.18 Z 




7 (23.3 Z) 


0.35 Z 


6 (20.0 Z) 


0.38 Z 


HD-CDDS 


12 (40.0 7.) 


0.26 Z 


11 (36.6 Z) 


0.17 Z 


MC-CDS 


11 (36.7 Z) 


0.25 Z 


13 (43.3 Z) 


0.26 Z 


ILOG OPL 


10 (33.3 Z) 


0.70 Z 


5 (16.6 Z) 


0.63 Z 



Table 10: Results for the comparison of different variants of hybrid tree local search 
methods for min Ci problem 



In Table [101 we observe that hybrid local search methods improve the best so- 
lutions found by ILOG OPL. All methods, except CDDS, find the best solution for 
a large number of instances and the mean deviation from the best solution are less 
important than ILOG OPL solutions. We observe that computing an upper bound 
highly increases the efficiency of the truncated search. 
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30 instances TCPU = 30s TCPU = 300s 



P'^U[l,5],Sij ~ C/[l,10] 


NbBest 


AvgD 


ev 


NbBest 


AvgDev 


CDS 


10 (33.3 7.) 


2.75 


7 


7 (23.3 Z) 


3.06 Z 


CDDS 


9 (30.0 7.) 


2.65 


7 


8 (26.7 Z) 


3.28 Z 


HD-CDDS 


13 (43.3 Z) 


1.92 


7 


10 (33.3 Z) 


2.56 Z 


MC-CDS 


13 (43.3 Z) 


1.75 


7 


11 (30.0 Z) 


2.29 Z 


ILOG OPL 


15 (50.0 Z) 


2.07 


7 


18 (60.0 Z) 


1.55 Z 


30 instances 


TCPU 


= 30s 




TCPC/ = 


300s 


p ~ U[l,S\,Sij ~ f/[20,40] 


NbBest 


AvgDev 


NbBest 


AvgDev 




3 (10.0 Z) 


2.76 


7 


2 (6.0 Z) 


2.89 Z 


CDDS 


3 (10.0 Z) 


2.71 


7 


2 (6.0 Z) 


2.88 Z 


HD-CDDS 


13 (43.3 Z) 


2.12 


7 


7 (23.3 Z) 


1.55 Z 


MC-CDS 


12 (40.0 Z) 


2.08 


7 


8 (26.7 Z) 


1.83 Z 


ILOC OPL 


15 (50.0 Z) 


0.91 


7 


19 (63.3 Z) 


0.90 Z 



Table 11: Results for the comparison of different variants of hybrid tree local search 
methods for min Lmax problem 



Table [H] shows the results for the minimization of maximum lateness. For this 
case, we observe ILOG OPL improves our results, but we can say that the proposed 
methods are still competitive, the mean deviation is acceptable and they found the 
best solution over 50 Z and 37 Z of instances, for 30 and 300 seconds respectively. 



6 Conclusion 

In this paper we have studied limited discrepancy-based search methods. We have 
compared and tested some of the existing options for different LDS components, 
such as discrepancy counting modes and branching structures, to solve the parallel 
machine scheduling problem with precedence constraints and setup times. 

New local search methods based on LDS have been proposed and compared with 
similar existing methods. The computational experiments show these methods are 
efficient to solve parallel machine scheduling problems in general and demonstrates 
the interest, at least for the studied problem, o f incorporating LPS in to a large 



neighborhood search scheme as first suggested by iMilano and Rolil (120021 ). 

We have suggested an energetic reasoning scheme integrating setup times and we 
have proposed new global and local dominance rules adapted to discrepancies. As 
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the results show, these evaluation techniques allow to reduce the number of explored 
nodes and the time of the search. 

As a direction for further research, the proposed methods could be extended to 
solve more complex problems involving setup times, like the hybrid flow shop or the 
RCPSP. 
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